SMOR: A German Computational Morphology Covering Derivation, Composition and Inflection

نویسندگان

  • Helmut Schmid
  • Arne Fitschen
  • Ulrich Heid
چکیده

We present a morphological analyser for German inflection and word formation implemented in finite state technology. Unlike purely lexicon-based approaches, it can account for productive word formation like derivation and composition. The implementation is based on the Stuttgart Finite State Transducer Tools (SFST-Tools), a non-commercial FST platform. It is fast and achieves a high coverage.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Umlaut and Inflection in German

The present paper examines the prosodic constituent Foot as the domain of phonological phenomena in German. Several processes take place in this constituent, such as Glottal Stop Insertion and Final Devoicing, as well as the phenomena that are described below: productive umlaut and infinitive inflection. The status of the trochaic Foot as the unmarked constituent in German is also discussed. Th...

متن کامل

Morphological Generation of German for SMT

We participated in the ACL WMT 2009 shared task for translation of German to English, and English to German. We used the Moses open source system, combined with morphological processing. For German to English, we had the only constraint system comparable with the open-data systems. One of the reasons the system performed well was strong reduction of the German vocabulary, through a simplistic c...

متن کامل

Zmorge: A German Morphological Lexicon Extracted from Wiktionary

We describe a method to automatically extract a German lexicon from Wiktionary that is compatible with the finite-state morphological grammar SMOR. The main advantage of the resulting lexicon over existing lexica for SMOR is that it is open and permissively licensed. A recall-oriented evaluation shows that a morphological analyser built with our lexicon has comparable coverage compared to exist...

متن کامل

Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica

In this paper we present preliminary work conducted on semi-automatic induction of inflectional paradigms from non annotated corpora using the open-source tool Linguistica (Goldsmith 2001) that can be utilized without any prior knowledge of the language. The aim is to induce morphology information from corpora such as to compare languages and foresee the difficulty to develop morphosyntactic le...

متن کامل

Syncretism without Underspecification: The Role of Leading Forms

The main goal of this article is to outline a new approach to syncretism in optimality theory, one that does not rely on the concept of underspecification taken over from grammatical theories which do not recognize constraint ranking and constraint violability. The analysis is based on a concept of morphological exponents as leading forms. Instances of syncretism can be traced back to the selec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004